Using character overlap to improve language transformation

نویسندگان

  • Sander Wubben
  • Emiel Krahmer
  • Antal van den Bosch
چکیده

Language transformation can be defined as translating between diachronically distinct language variants. We investigate the transformation of Middle Dutch into Modern Dutch by means of machine translation. We demonstrate that by using character overlap the performance of the machine translation process can be improved for this task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Linguistic Account of the Protagonist’s Development in the Grapes of Wrath

The novel as a modern literary genre is generally regarded as the realization of its main character's journey from immaturity to a status of maturity. The character, usually an uncomplicated person unable to cope with the complexities of life at first, gains an insight and understanding to handle his/her complex situation accordingly later in the novel. It is usually agreed in both literary cri...

متن کامل

A new model for persian multi-part words edition based on statistical machine translation

Multi-part words in English language are hyphenated and hyphen is used to separate different parts. Persian language consists of multi-part words as well. Based on Persian morphology, half-space character is needed to separate parts of multi-part words where in many cases people incorrectly use space character instead of half-space character. This common incorrectly use of space leads to some s...

متن کامل

Pygmalion in Conversation with Pierre Bourdieu:A Sociological Perspective

George Bernard Shaw's masterpiece Pygmalion deals with the social function of language and reveals that Linguistic Competence is one of the markers of social status. It presents the story of the social transformation of a flower girl into a ‘lady’ through linguistic retraining. This work has been analyzed from a variety of perspectives such as Freudian psychology and sociolinguistic perspective...

متن کامل

طراحی و ساخت کانسترکت نوترکیب واجد ژن اینترفرون بتای جهش یافته در ناحیه‌ کزاک (Kozak) به منظور تشدید ترجمه

Background: Interferon beta is one of the most important members of group I interferons and is the main drug for multiple sclerosis treatment. Interferon beta has short half life and this compels patients to make frequent use of medicine. According to its clinical usage there is broad effort to improve translation level and protein production. There are several important factors which effect pr...

متن کامل

Twitter Paraphrase Identification with Simple Overlap Features and SVMs

We present an approach to identifying Twitter paraphrases using simple lexical overlap features. The work is part of ongoing research into the applicability of knowledgelean techniques to paraphrase identification. We utilize features based on overlap of word and character n-grams and train support vector machine (SVM). Our results demonstrate that character and word level overlap features in c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013